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Systematic Gene Search in the Incyte LifeSeq Database 
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Principle of EST Assembly 



'50,000 ESTs per tissue 





Assembly at 0% mismatch 
with GAP4 (Staden) 



A* 



Contigs 



Contigs increasing in 
number and length 



Singletons 




Iterative assembly with 
increasing mismatch 
(1%, 2%, 4%) 



5000-6000 Contigs ~ 25,000 other singletons 




~ 30,000 consensus- 
sequences per tissue 



FIG. 2a 



09/623746 



-50,000 ESTs 
of a tissue 
(e.g.: uterus tumor) 



GAP4 Assembly 1st Round: 
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maximum percent mismatch: 0 
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GAP4 Assembly 2nd Round: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 1 
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GAP4 Assembly 3rd Round: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 2 
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GAP4 Assembly 4th Round: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 2 
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GAP4 Assembly 5th Round: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 4 
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GAP4 Assembly 6th Round: 
minimum initial match: 20 
maximum pads per read: 8 
maximum percent mismatch: 4 
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of a specific tissue 
(e.g.: uterus tumor) 
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of a specific tissue 
(e.g.: uterus tumor) 
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Genes of interest 
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expression via electronic 
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Hybridization signal 
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Sequencing of clones that are located in regions that have 
chromosomal deletions in prostate and breast cancer leads to 
identification of candidate genes 
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Confirmation of candidate genes by screening of 
mutations and/or deletions in cancer tissues 
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